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Abstract — This work analyzes the high-SNR asymptotic er- 
ror performance of outage-limited communications with fading, 
where the number of bits that arrive at the transmitter during 
any time slot is random but the delivery of bits at the receiver 
must adhere to a strict delay limitation. Specifically, bit errors 
are caused by erroneous decoding at the receiver or violation of 
the strict delay constraint. Under certain scaling of the statistics 
of the bit-arrival process with SNR, this paper shows that the 
optimal decay behavior of the asymptotic total probability of 
bit error depends on how fast the burstiness of the source 
scales down with SNR. If the source burstiness scales down too 
slowly, the total probability of error is asymptotically dominated 
by delay-violation events. On the other hand, if the source 
burstiness scales down too quickly, the total probability of error 
is asymptotically dominated by channel-error events. However, 
at the proper scaling, where the burstiness scales linearly with 
yiogSNk optimal coding duration and transmission 

rate, the occurrences of channel errors and delay-violation errors 
are asymptotically balanced. In this latter case, the optimal 
exponent of the total probability of error reveals a tradeoff that 
addresses the question of how much of the allowable time and rate 
should be used for gaining reliability over the channel and how 
much for accommodating the burstiness with delay constraints. 



I. Introduction 

This work analyzes the high signal-to-noise-ratio (SNR) 
performance of outage-limited communications where the in- 
formation to be communicated is delay-limited and where the 
information arrives at the transmitter in a stochastic manner 
We consider the following setting (Figure [T]) in our study: 

• A random number of bits arrive at the transmitter during 
any given timeslot. Bits accumulate in an infinite buffer 
while waiting for their turn to be bunched into codewords 
and transmitted under a first-come, first-transmit policy. 
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• There is no feedback to the transmitter; retransmission of 
the bits in error is not considered. 

• Communication over the fading channel is outage-limited 
( [1], [2]), where the transmitter is unaware of the instan- 
taneous channel state and, as a consequence, operates at 
a fixed transmission rate, R. During a deep fade (also 
known as an outage), the channel seen by the decoder 
is too weak to allow recovery of the data content from 
the transmitted signal. Characteristic settings are those of 
MIMO and cooperative outage-limited communications. 

• Coding takes place in blocks where each codeword spans 
over a fixed and finite integral number, T, of timeslots. 
Each codeword has an information content of RT bits. 
In addition, coding is "fully-diverse," i.e., the decoding 
at the receiver takes place only at the end of the coding 
block. 

• The delay bound, D, is a maximum allowable time 
duration from the moment a bit arrives at the transmitter 
until the moment it is decoded at the receiver. The delay 
experienced by a bit is the sum of the time spent waiting 
in the buffer and the time spent in the block decoding 
process. Note that the waiting time in the buffer is random 
due to the stochastic arrival process. 

• A bit is declared in error either when it is decoded 
incorrectly at the decoder, or when it violates the delay 
bound. 

For the above setting, we are interested in the high-SNR 
asymptotic total probability of bit error Note that for a given 
transmission rate, R, and a coding block duration, T, there 
exists a tradeoff between the probabilities of decoding error 
versus the delay violation. We expect that longer coding blocks 
allow the encoded bits to be transmitted over more fading 
realizations and hence, achieve higher diversity and fewer 
decoding errors. However, longer coding blocks cause more 
bits to violate the delay requirement. In other words, one 
intuitively expects that there is an optimal choice of the fixed 
transmission rate, R, and the fixed coding block duration, T, 
for which the total probability of bit error is minimized. The 
goal of this paper is to analytically identify these optimal 
quantities. 

A. Prior Work and Our Contribution 

High demands on the quality of service (QoS), in terms of 
both packet losses and packet delays, have fueled substantial 
research interest in jointly considering channels and queues. 
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Communication of delay-sensitive bits over wireless channels 
has been addressed under various assumptions and settings in 
works such as [3]-[8]. Often, asymptotic approximations are 
employed to enable tractable analysis of the problem. Below 
we detail the existing work with their corresponding settings 
and the relationships to this paper. 

The first group we discuss, [3]-[6], consists of scenarios 
where the current channel state information (CSI) is assumed 
to be known at both the transmitter and receiver. For example, 
in [3] and [4], Berry and Gallager address the tradeoff between 
the minimum average power consumption and the average 
delay (the power-delay tradeoff) over a Markovian fading 
channel with CSI both at the transmitter and the receiver. 
In such a setting, the transmitter dynamically varies power 
(i.e., the rate) in response to the current queue length and 
channel state. In [5], Rajan et al. derive optimal delay-bounded 
schedulers for transmission of constant-rate traffic over finite- 
state fading channels. In [6], Negi and Goel apply the effective 
capacity [9] and error exponent techniques to find the code- 
rate allocation that maximizes the decay rate of the asymptotic 
probability of error for a given asymptotically-large delay 
requirement. Similar to [3] and [4], the proposed dynamic 
code-rate allocation in [6] is in response to the current channel 
fading and is possible by assuming CSI knowledge at the 
transmitter. 

A second group of work (e.g., [7], [8]) focuses on scenarios 
where CSI is unknown to the transmitter but there is a 
mechanism for retransmission of codewords when the channel 
is in outage. As a tradeoff to protection against channel outage, 
this retransmission incurs extra delays to the bits in the buffer. 
In [7], for example, Bettesh and Shamai (Shitz) address the 
problem of minimizing the average delay, under average power 
constraints and fixed transmission rate. They provide asymp- 
totic analysis, under heavy load condition and asymptotically 
large queue length, for the optimal adaptive policies that adjust 
the transmission rate and/or transmission power in response 
to the current queue length at the transmitter. In another 
example, Liu et al. in [8] study the problem of optimal (fixed) 
transmission rate to maximize the decay rate of the probability 
of buffer overflow for on-off channels and Markov-modulated 
arrivals. The channel is considered "off" when outage occurs. 

Although our work uses a similar performance measure to 
[6], namely the decay rate of the asymptotic probability of 
error, it covers the scenarios in which CSI is not available 
to the transmitter (no CSIT) and there is no retransmission. 
In such a setting, the variation of the fading channel is 
combatted via a coding over multiple independent fading 



realizationsQ While this approach improves the transmission 
reliability, its longer coding duration increases the end-to- 
end delay any bit faces, and can potentially increase the 
probability of delay violation. In other words, in the absence 
of CSIT and retransmission, the transmission reliability, as 
well as the delay violation probability, are functions of the 
coding rate and duration. Consequently, our work compliments 
this previous research as it considers the effect of a delay 
violation requirement, in the absence of CSI at the transmitter 
and retransmission, on the operation of the physical layer. 
We consider a fixed transmission rate and code duration, as 
opposed to dynamic policies. 

Since it is difficult to derive the exact relationship between 
the system parameters and the probabilities of channel de- 
coding error and the delay violation, we choose to study 
an asymptotic approximation when the signal-to-noise ratio 
(SNR) is asymptotically high. The first advantage of this 
choice is the availability of an asymptotic high-SNR anal- 
ysis for the channel decoding error probability. This high- 
SNR analysis is known as the diversity-multiplexing-tradeoff 
(DMT) analysis [1] and has received a great deal of attention 
during the past few years. Another advantage of the high-SNR 
analysis is that, for the class of arrival processes we consider in 
this paper, we can derive an asymptotic approximation of the 
delay violation probability that is valid even when the delay 
requirement D is finite and small. This derivation (Lemma |2l) 
is based on a large-deviations result known as the Gartner- 
Ellis theorem (see e.g., [10]) and extends the large deviations 
exponent for a queue with asymptotic number of flows (as 
provided in [11]-[14]) to a queue with batch service discipline. 
Given that the asymptotic expression of the total probability 
of bit error is valid without requiring asymptotically large D, 
it is then meaningful to ask about the optimal coding block 
duration, a question which is not answered in studies with 
asymptotic D (e.g., [3], [4], [6], [15]-[17]). 

We also would like to point out that our work was motivated 
by the work of Holliday and Goldsmith [18] where, under 
a high-SNR asymptotic approximation, the optimal operating 
channel transmission rate for a concatenated source/channel 
system is studied. Following the approach in [18], we study 
a concatenated queue/channel system under a high-SNR ap- 
proximation. 

' For example, the multiple independent fading realizations can be a result of 
fading in multiple channel coherence time intervals (known as time diversity), 
or fading in multiple independent spatial channels, as in MIMO channel 
(spatial diversity), or cooperative relay channel (cooperative diversity). 
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B. Overview of the Results 

This work focuses on the notion of SNR error exponent as 
a measure of performance. Specifically, we are interested in 
finding how the asymptotic total probability of error decays 
with SNR. To keep the problem meaningful, we consider a 
scenario under which the overall traffic loading of the system 
(the ratio between the mean arrival rate and the ergodic 
capacity of the channel) is kept independent of SNR. That is, 
we consider a case where the arrival rate scales with log SNR. 
Note that this scaling of arrival process is necessary to ensure 
a fixed loading and hence a comparable cross-layer interaction 
as SNR scales. 

From the DMT result, we already know that, if the channel 
operates below the channel ergodic capacity, the asymptotic 
probability of channel decoding error decays with SNR. The 
best one can hope for is that the asymptotic total probability of 
error decays exponentially with SNR. For that, the asymptotic 
probability of delay violation needs to decay with SNR. 
Specifically, we consider a class of i.i.d0 arrival processes 
with light tail (i.e., the processes have all moments finite) 
whose burstiness (defined as the ratio of the standard deviation 
over the mean of the number of bits arrived at a timeslot) 
monotonically goes to zero as SNR goes to infinity. We show 
that for all such processes (called smoothly-scaling processes), 
the total probability of error decays. 

The main result of the paper shows that the optimal decay 
behavior of the asymptotic total probability of bit error de- 
pends on how fast the burstiness of the source scales down 
with SNR. If the source burstiness scales down too slowly 
(too quickly), the majority of the errors are due to delay 
violation (channel error), i.e., the total probability of error is 
asymptotically dominated by delay-violation (channel-error) 
events. However, at the proper scaling where the burstiness 
scales linearly with ^^^^^^^^ and with the optimal coding 
duration and transmission rate, the occurrences of channel 
errors and delay-violation errors are asymptotically balanced. 
Equivalently, one can interpret our result, the optimal choice 
of block coding duration and transmission rate, as that which 
balances the channel atypicality (deep fading or outage events) 
and the arrival atypicality (large bursts of arrivals). 

We apply this result to several examples of outage-limited 
communication systems to find the optimal setting of the 
operating parameters. 

C. Outline of the Paper 

The precise models for the coding/channel process and 
the bit-arrival/queue process are described in Section HI] We 
precisely define the scaling of the source process with SNR 
and give a simple example of such source processes. Sec- 
tion |III]provides the asymptotic probability of delay violation. 
The main result of the paper is found in Theorem [T] of 
Section |IV] This theorem provides the optimal asymptotic 

-Note that, since the adopted channel model is not assumed to be i.i.d., 
assuming an i.i.d. arrival process, intuitively, is not consequential: think of 
our chosen time slot as an upperbound for the "coherence time" of the arrival 
process. The i.i.d. source assumption mostly serves to simplify the exposition 
and presentation of results, and does not fundamentally limit the setting. 



decay rate of the total error probability as well as the optimal 
coding duration and transmission rate. Section [V] gives some 
examples to illustrate the utility of TheoremfT] These examples 
consider the question of optimally communicating delay sen- 
sitive packet stream with a compound Poisson traffic profile 
over the following outage-limited channels: SISO Rayleigh 
fast-fading channel, quasi-static cooperative relay channel, 
and quasi-static MIMO channel. Section |VI] concludes the 
paper Appendices include the proofs of various lemmas and 
theorems. 

D. Notations 

We use the following symbols and notations. We use p 
to denote SNR. The notation = for a strictly increasing and 
positive-valued function g represents the equivalence between 

yip) = z{p) and lim = lim ^28^. We define > 

g 

and < in a similar manner. Note that when g is an identity 
function, then = is equivalent to the familiar = notation in the 
DMT analysis [1]. 

We denote the high-SNR approximation of the ergodic 
capacity of AWGN channel by N logp and use N and 
logp interchangeably. The sets Z, N, and Z+ represent the 
set of all, positive, and non-negative integers, respectively. 
In addition, the set T represents the set {1, 2, . . . , [-jj }. 
Flooring and ceiling functions are denoted by [-J and [•], 
respectively. For all a < b, [x]^ = max(a, min(6, a;)) and 
[a;]+ = max(2;,0). We write g{x) = Q{h{x)) to denote 
that the function g scales linearly with the function h, i.e., 
lim < oo and lim < oo. Finally, for any function 
/, we denote its convex conjugate, /*, by 

f*{x)^snp 9x- f{0). (1) 

II. System Model 

As discussed in the introduction, we consider a system 
composed of a bursty and delay-Umited information source, 
concatenated with an infinite buffer and a fading channel, as 
shown in Figure [1] We assume the queue follows a first-come- 
first-serve (FCFS) discipline. The departures out of the queue 
occur according to a block channel coding scheme, while 
the arrivals to the queue follow a stochastic model. If the 
transmission rate is above the instantaneous capacity of the 
channel, an outage event is said to occur where the received 
signal is erroneously decoded. The delay requirement asks 
that each bit of information be decoded at the destination 
within a maximum allowable delay of D time-slots from 
the time it arrives at the buffer Otherwise, the bit will be 
obsolete, discarded, and counted as erroneous. We assume no 
retransmission of unsuccessful transmissions or those bits 
which violate the delay boundOl In the next three subsections, 

^^Note that due to the constant service rate of the queue and the FCFS 
service discipline, any bits arriving at the queue know iininediately whether 
they will exceed their delay constraints, using the knowledge of the cun'ent 
queue length. It seems wise to drop these bits immediately after their arrivals 
to improve the systein performance. However, we do not need to consider 
such method because it has been established (see [14, Theorem 7.10]) that, in 
the asymptotic regime of interest, such method does not improve the exponent 
of the delay violation probability. 
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we describe in detail the models for the channel, the arrival 
process, and the system performance measure. 

A. Channel and Coding Model 

We consider a general fading-channel model, 

y = Hx + w_, 

where x_ is the transmitted vector, H is the channel matrix, y 
is the received signal, and w_ is the noise vector The average 
SNR is defined as [1] 

_ E[||Hx||2] 
E[i|u;||2] ' 

and in the asymptotic scale of interest, it is equivalent to 

p = nM\\ 

Coding takes place over T timeslots, using rate-i?, length-T 
codes that meet the DMT tradeoff dch(?', T) [1], defined as 



dch{r,T) := - lim 

p^OC 



logP,h(r,r,p) 
logp 



(2) 



where Pa^ir^T^p) is the codeword error probability induced 
by the channel, given an optimal code of multiplexing gain 
r, coding block size T timeslotfl and average SNR p. The 
channel multiplexing gain r is related to the transmission rate 
R as (refer to [1]) 

r lim . (3) 

p^oc logp 

That is, the transmission rate R is assumed to scale linearly 
as rlogp. We denote by fmax the maximum value of r, i.e., 
< r < Tmax- This Tmax relates to the ergodic capacity as 

maxp^ I{x; y) 



= E 



H- 



\ogp 



and is the smallest r such that dch{r, T) = 0. 

The DMT tradeoffs have been extensively studied for 
various finite-duration communication schemes (for example, 
see [19], [21]-[24] for MIMO point-to-point communica- 
tions, [25] for multiple access communications, [26], [27] for 
cooperative communications, and [20], [28] for cooperative 
communications with small delay). 

Remark 1: The condition that each bit be transmitted over 
all timeslots in the coding bloclfl, together with the first- 
come first-transmit service discipline, makes it so that every 
T timeslots, the RT oldest bits are instantaneously removeqj 
from the queue and are transmitted over the next T timeslots. 
We assume that it is only at the end of the T timeslots that 
all the RT bits are decoded by the decoder. 

"^For most settings, there exist codes that meet the entire DMT tradeoff in 
minimum delay, independent of channel dimensionality and fading statistics 
[19H21]. 

^Currently, all known minimum-delay DMT optimal codes over any fading 
channel with non-zero coefficients ask that each bit be transmitted over each 
timeslot. 

^If an insufficient number of bits exists in the buffer, null bits are used 
and the rate is maintained. It is easy to show that, in the asymptotic scale of 
interest, the use of null-bits does not incur any change in the SNR exponent 
of the probability of enor. 



Example 1 (Rayleigh Fast-Fading SISO Channel): 
Consider the single-input single-output (SISO) time-selective 
channel with Rayleigh fading coefficients (correlated or 
uncorrected) and with additive white Gaussian noise at the 
receiver. The corresponding channel model over T timeslots 
is given by 

y = dmg{h) x + w, 

where y,h,x, and w are T x 1 vectors and H — diag(/i) is 
a T X T diagonal fading matrix with the fading at the tth 
timeslot, ht, as its {t, t) element. The optimal DMT, given 
optimal signaling, takes the form 

, , logPr{l{x;y\h)<2^^) 



logp 



lim 

p — >oo 



logPr{ULii^ + pM')<p^^) 



logp 



For the fast-fading case where the coherence time is equal to 
one timeslot and the elements of h are Rayleigh i.i.d. random 
variables, the tradeoff takes the form 



deh(r,T) -T(l-r), 



(4) 



and it can be met entirely in T timeslots (see [1]). This SISO 
channel allows for r,„ax = 1- 

Other examples which will be discussed later in Section |V] 
are quasi-static MIMO and cooperative-relay channels. In this 
paper, for simplicity we assume that dch{r,T) is continuous 
on r, decreasing on r, and increasing on T. 

B. Smoothly-Scaling Bit-Arrival Process 

In this subsection, we describe the SNR-scaling of a family 
of arrival processes of interest. The specific choice of SNR- 
scaling for the statistics of the bit-arrival process is such that 
the average traffic load of the system (defined as the ratio 
of the average arrival rate over the ergodic capacity) is kept 
constant, independent of SNR0 This means that scaling in the 
ergodic capacity r^ax log p (= r^aaxN) is matched by scaling 
the average bit-arrival rate as A log p (= \N) as well, for some 
A > 0. Now we are ready to introduce the arrival process of 
interest: The sequence of asymptotically smoothly-scaling bit- 
arrival processes, in which the process becomes "smoother" 
for increasing N. 

Definition 1: Let Q denote a class of functions which 
contains any function g : R+ R+ (called scaling function) 
which is continuous and strictly increasing and whose tail 
behavior is such that 

9ix) 



lim 



oo. 



(5) 



E— too logo; 

Definition 2: (g-smoothly-scaling source) Consider a scal- 
ing function g G G and a family of bit-arrival processes 

^It can be seen that unless the traffic load (average bit arrival rate over the 
channel rate) scales as log(SNR), i.e., linip^oo tj;^ = ^ f™ some fixed 
< £ < oo, the problem is void of cross-layer interactions. Otherwise if 
^ = 0, corresponds to the case where too few bits anive and effectively there 
is no queuing delay. On the other hand, when the traffic load scales much 
faster than log(SNR), i.e., £ = oo, the overall performance is dominated by 
queueing delay, independently of the channel characteristics. 
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(A(^),7V e N), where A^^^ = (A|^\t G Z) denotes an 
i.i.d. sequence of the random numbers Aj^-* of bits that arrive 
at time t with E[A[^''] = XN, for all t. The family of bit- 
arrival processes is said to be g-smoothly-scaling if the limiting 
g-scaled logarithmic moment generating function, defined for 
each G R as 



Km 

N^oo 



log£;[exp(^AW)] 



(6) 



9{N) 

exists as an extended real number in R* := M U {oo} and is 
finite in a neighborhood of the origin, essentially smooth, and 
lower-semicontinuous^ 

Remark 2: It is straight forward to show that A is convex 
and A'(0) = A (see [14, Lemma 1.11]). 

Note that A describes how close the average bit-arrival rate 
is to the asymptotic approximation of the ergodic capacity of 
the channel. For stability purpose and to ensure the existence 
of a stationary distribution, we require that A < rn,ax- Also, 
note that we abuse the notation and denote the arrival process 
by A\.^\ despite its possible dependency on the scaling 
function g. 

1 ) Motivation for Smoothly-Scaling Assumption: The as- 
sumption of g-smoothly-scaling arrival processes allows us to 
find the decay rate of the tail probability of the sequence of 
process {s\^\n which is a sum process defined as 



(AT) 



t 

E 



Af\ tGN; 



since (A^^^j G Z) are i.i.d., S*}^"* is also a g-smoothly scal- 
ing process with the limiting g-scaled log moment generating 
function As^ given as 



lim 



9iN) 



tAie). (7) 



Now, given that the sequence {sl^\ N G N) is g-smoothly- 
scaling, we can use the Gartner-Ellis theorem (see e.g., [10] 
and [14]) to give the following result on the decay rate of 
the tail probability of the sequence. The following proposition 
provides an important basis for the analysis of the asymptotic 
probability of delay violation in Section |III] 

Proposition 1: (Gartner-Ellis theorem for g-smoothly- 
scaling process) Consider g G G and a family of g-smoothly- 
scaling processes {A'^^\N G N) with the limiting g-scaled 



log moment generation function A. Let 
for t G N. Then, for a > At, we have 



(N) 



(N) 



lim 

Af^oo g[N) 



logP 



S. 



(N) 



N 



>a \ = -tA*(a/t), 



(8) 



where A* is the convex conjugate of A. 

Proof: See Appendix H] ■ 

* [14] A function / : M ^ R* is essentially smooth if the interior of its 
effective domain T) = {x : f{x) < oo} is non-empty, if it is differentiable 
in tlie interior of © and if / is steep, whicii means tliat for any sequence 9„ 
wliicli converges to a boundary point of T), then Iim„^oo l/'(^n)l = +cxd. 
/ is lower-semicontinuous if its level sets {x : f{x) < a} are closed for 



2) Asymptotic Characteristic of Smoothly-Scaling Pro- 
cesses: Intuitively, the g-smoothly-scaling arrival processes 
become smoother as SNR increases. This intuition follows 
from (|6]l, which implies that for (^ G R such that K{9) < oo 
and e > 0, there exists A^o such that for N > Nq, 



exp {g{N)A{e) - g{N)e) < E 



exp 



< exp (.g(A^)A(6l) - 



Ai"^ 



Then, if we let ^^(Ar) be a sum of g{N) i.i.d. random variables 
(i.e., Yg^N^ ATi + • • • + Xg^N) with Eie^'"^^] = e^<^^^), we 
have £'[e^('^s("))] = eHe)9iN)^ Therefore, at sufficiendy large 
A^, ^S^^Aj^'' and ^^(Ar) have the same moment generating 
function and hence the same distribution. If we define the 
burstiness of the random variable A[^'' as the (dimensionless) 
ratio of its standard deviation over its meanQ then, using 

std(A<"') 

the above intuition, the burstiness — for large N is 

std( 



approximately equal to 



to 



std(A'i 



XN 



which is reduced 



Hence, the burstiness of A[^'' decays to zero 
In other words, the g-smoothly- 



approximately as , 

scaling arrival processes become smoother as SNR increases. 

3) Examples of Smoothly-Scaling Processes: One of the 
common arrival processes used for traffic modeling is a 
compound Poisson process with exponential packet size, de- 
noted as CPE. For this source, the random number of bits, 
A\^\ arrived at timeslot t, is i.i.d. across time t and is in the 
form of 



A 



(AT) 



(AT) 

t ' 



(9) 



where M^^^ is the random variable corresponding to the 
number of packets that have arrived at the t*'' timeslot, and 
where i^/f corresponds to the random number of bits in the 
i*-^ packet. Mf ^^ are independently drawn from a Poisson 



distribution with mean v{N); and 



(AT) 



1, ■ 



(N) 



are independently drawn from an exponential distribution 
with mean -j^j^ (nats per packet). Note that the assumption 

that E[A\^\ = AA^ forces that ^ = AA^. In addition, 
a larger average packet size -j^^ implies a more bursty 
arrival process^ It is known (see [16]) that the log moment 
generating function of this CPE random variable ^j^"* is 

logi?[e^^^"'] = |^' '^^^""^^ (10) 
I oo, otherwise. 

The following examples illustrate that, depending on the 
scaling of the average packet arrival rate and the average 

'Note that the burstiness definition here is basically the normalized variation 
of the random variable around its typical value (its mean). A more familiar 
definition of traffic burstiness would involve how the traffic are correlated 
with time, i.e., a bursty source tends to have large bursts of arrivals in a short 
period of time. However, since we only consider the source which is i.i.d. 
over time, we use this definition of burstiness. 

'"it can be easily shown that the burstiness of this CPE process, as defined 
in Section lItR2] is ^[^^^y 
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packet size, some CPE processes may or may not be g- 
smoothly-scaling. 

Example 2 ( g-smoothly-scaling CPE process): For g E G 
and /i > 0, consider a CPE process A^^^ with packet arrival 



rate fiXg{N) and average packet size 



N 



This family of 



processes is ij-smoothly-scaling because, using (fTOl i, we have 

MO) 



lim 

Af— >oo 



logi?[e N ^1 J I ^ — 



^ ^_ _ < 

giN) loo, otherwise, 

(11) 

which satisfies the conditions in the definition of g-smoothly- 
scaling. Since we will use this particular g-smoothly-scaling 
CPE process for examples in the paper, we denote it as 
CPE{X, ^, g, N). It is useful to note a particular case when 
g{N) grows linearly with N. Using a property of the Poisson 
process [29], this particular scaling case can be viewed as 
aggregating an increasing number of Poisson traffic streams 
(this number grows linearly with N), with each stream having 
the same packet length distribution. 

To complete our discussion on smoothly-scaling processes, 
we give an example below of a family of CPE arrival processes 
which is not g-smoothly-scaling. 

Example 3: A family of CPE processes where A^j.^'' has 
packet arrival rate /zA and average packet size N/ ^ (note the 
dependence on N only in the average packet size) is not g- 
smoothly-scaling for any g e Q. This is because, using ( fTOl l. 
we have 



lim 



log£;[e- 



9iN) 



0, o<o, 

oo otherwise. 



which is not finite in the (open) neighborhood of 6* = 0. Hence, 
this family of processes is not g-smoothly-scaling. 

Remark 3: The scaling function, g, describes the way the 
source statistics scale with SNR. Example|2]describes the case 
of the compound Poisson process, where g can be identified 
as the function that specifies how the average packet arrival 
rate (/iAg(log SNR)) and the average packet size ( ^g'(°iof snr) ) 
scale with SNR. 

C. Performance Measure and System Objective 

The overall performance measure is the total probability 
of bit loss, Ptot{r,T), where loss can occur due to channel 
decoding error or the end-to-end delay violation. Specifically, 

Ptot(r,r) Peh(r,T) + (1 - Pch(r,T)) Pdeiay (r,T), (12) 

where Pchir, T) denotes the probability of decoding error due 
to channel outage and Pdeiay {f, T) denotes the probability of 
delay violation. We are interested in finding the high-SNR 
asymptotic approximation of Ptoi{r,T) as a function of r, 
T, SNR, D, as well as the source and channel statistics 
(including A and the source scaling function g). In the interest 
of brevity, we denote Ptot as a function of only r and T, 
the two parameters over which the performance will later be 
optimized. 

Since the high-SNR asymptotic expression of Pcb{r,T) is 
already given by the DMT in (O, what remains is to find the 
asymptotic expression for Pdeiay (^;P)j which is shown in the 
next section. 



III. Asymptotic Analysis of Probability of Delay 
Violation 

In this section, we derive the asymptotic probability of delay 
violation PiAay{f,T) for the channel multiplexing rate r and 
coding block size T. We observe that the adopted block coding 
forces the queue to have a batch service that occurs every T 
timeslots with the instantaneous removal of the oldest rNT 
bits. The decay rate of the asymptotic tail probability of the 
sum arrival process, given in Proposition[Tl in conjunction with 
an asymptotic analysis of a queue with deterministic batch 
service, gives the following result: 

Lemma 2: Given g S fj, T £ T, r > A, a batch service of 
rNT every T timeslots, and a g-smoothly-scaling bit-arrival 
process characterized by the limiting g-scaled log moment 
generation function A, the decay rate of Pdeiay ('^jP) is given 
by the function /, i.e.. 



1 



lim , , 
N^oo g[N) 



l0gPdelay(r,T)- -/(r,T), 



(13) 



where 

I{r,T) 



tG 



7+. 



(^T + T - 1 - fc)A* 



tT+T-l-k>0 



(D + \- 2T)r 
tT+T-l-k 

(14) 



for k = D(modT). In addition, I{r,T) is lower- 
semicontinuous and increasing on r. 

Proof: See Appendix HH ■ 
Approximation 1: Relaxing the integer constraint in ( [141 ) 
gives the lower bound of / as 



where 



I{r,T)>5rr{D + l-2T) =:/,,(r,T), (15) 



sup{6l > : A(6l) < Or}. (16) 



We use this lower bound as an approximation to / as well, 
i.e., 

/(r, T) « I„{r, T) = SMD + 1 - 2r). (17) 
Proof: See Appendix HH ■ 
Example 4: For a g-smoothly-scaling CPE(A, /i, g, iV) bit- 
arrival process, the function / in (fT4] i can be calculated exactly 
with the following A*: 



a*(.t) = /i(v^- Va)', 



(18) 



However, an approximation of / in ( [TtI i is simpler to work 
with and given as 

/(r, T) « /„. (r, T) ^ ^i{r - \){D + 1 - 2T) , (19) 

where, using ( fTSI l and (fTTI ). Sr is given as 

A\ 

(20) 



Sr = f-l [ I — 



We will see via numerical examples in Section |V- A. 1 1 that the 
approximation in ( fT9] l is sufficient for our purpose. 
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IV. Main Result: Optimal Asymptotic Total 
Probability of Error 

In this section, we present the main result of the paper which 
states the optimal decay rate of the high-SNR asymptotic total 
probability of bit error. Recall the definition of Ptot from ( fTSl i: 

P,o,(r,T) := Peh(r,T) + (1 - Pch(r, T))Pdelay(r, T), 

where we now know that 



-dch(r,T) 



and 



Pdelay(r,r)^e-^('-^^)^'('°S''). 



Hence, the asymptotic optimal decay behavior of Ptot depends 
on the function g. The following theorem gives the main result 
of the paper. 

Theorem 1: Consider g E G and a g-smoothly-scaling bit- 
arrival process. The optimal rate of decay of the asymptotic 
probability of total bit error, maximized over all r e (A, rmax) 
and T gT, and the optimizing r* and T* are given, depending 
on the tail behavior of the function g, as follows: 



TV 



Case 1 : If jim ^ =76 (0,oo), then 



sup lim 

re(A,r„,„) P^°° 



logP,ot(r,r) 



logp 

dch(r*,T*) = 7/(r*,r*), 



(21) 



rate in Pch{i,T) and Pdeiay('', T). Hence, for Case 1, the optimal 
asymptotic total probability of error decays as follows: 

Ptot(r*,r*) = Pdeiay(r*,T*) = Pch(r*,r*) = p-'''. 

Note that d* is nothing but the optimal negative SNR exponent. 
In Case 2, where the source burstiness scales down slower 



than 9( ) but faster than e(- 



= ), we have that 



where 



/log p ' ' ""^ Vlog logp'^ 

-PtotC?", P) is asymptotically equal to Pdeiay(f, P) for all r e 
(A, fmax) and P e T. In this case, the decay rate of Pxoi{r, P) 
is equal to I{r,T). In other words, the channel error (outage) 
probability is dominated by the delay violation probability and, 
hence, can be ignored. 

Finally, in Case 3, when the source burstiness scales down 
faster than Q{-^^=), we have the opposite of Case 2. In Case 
3, the delay violation probability is dominated by the channel 
error probability and, hence, can be ignored. 



A. Approximation of the Optimal Negative SNR Exponent 

For Case 1 in Theorem [T] we use the following approxi- 
mation which is an immediate result of relaxing the integer- 
constrained optimizations of / and P* to obtain approximated 
expressions with much simpler forms. These approximations 
become especially useful in Section [Vl 

Approximation 2: Relaxing the integer constraints in the 
calculation of / (as in Approximation [T]) and P* in ( [23] ) gives 
the following "integer-relaxed" approximations for d* ,r* , and 



r*(P) 
T* 



inf{r e (A, w) : iHr, T) = deh(r, P)K22) 
argmax I{r*{T),T) (23) 

r*{T*). (24) 



Case 2 : If lim 



9{N) _ 



N 



= and lim 

TV— o< 



g{N) _ 



log AT 



= 00, then 



1- -log Ptot (r,P) ^ ^, 

sup iim < max 1 (r„ 

TeT 



,T). (25) 



Case 3: If lim ^^4P- = 00, then 



N- 



N 



r -log Ptot (r,P) ^ ^ , , 
sup lim < "ch I A, 

r-e(A,r„,„), P~*°° lOgp 

TeT 

Proof: See Appendix Hill 



(26) 



Theorem [T] shows that the optimal decay behavior of the 
asymptotic total probability of error depends on the tail 
behavior of the function g. As discussed earlier, the burstiness 
of the g-smoothly-scaling arrival process scales down as 



e( 



=). Below, we discuss each case of Theorem[Tl with 



VsOogp) ' 

respect to the scaling of the source burstiness: 

In Case 1, where the source burstiness scales down with 
0(--^===), both components of the probability of error decay 
exponentially with log p. In this setting, one can optimize the 
choices of r and P to arrive at a non-trivial optimal decay rate 
d* . The optimal r* and P* balance and minimize the decay 



:=rfch(CP*,), (27) 
P*«i;;, andr*«rr„ 

where, for Sr given in ( fT6l ) and any TeT, 

rliT) min{r G (A,rn,ax) : dch(r,P) - j5rr{D-2T+l)}, 

(28) 

and 



mill < T G 



— (deh(r*.(T),P))=0 



(29) 
(30) 



V. Applications of the Result 



In this section, we apply the result of Case 1 in Theo- 
rem [T] to analyze and optimize the end-to-end error probability 
of systems communicating delay-sensitive and bursty traffic 
over three outage-limited channels: SISO Rayleigh fast-fading 
channel, quasi-static cooperative relay channel, and quasi- 
static MIMO channel. 

To illustrate the methodology, we restrict our attention to the 
case of CPE(A, /i, g, logp) arrival process where ^(logp) = 
logp, for simplicity. Note that to better gain insights, we 
use the integer-relaxed approximations obtained in Approx- 
imation |2] 
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A. SISO Rayleigh Fast-Fading Channel 

Our first example considers an example of SISO Rayleigh 
fast-fading channel, whose dch{r,T) = T(l — r) (see (HI). 
Combining this with ( l20t and ( |28] | gives the optimal choice of 
multiplexing gain when the coding duration is fixed at T as 

C(r) = A + — i^^-^. (31) 



1 



fi{D + l-2T} 



In addition, using (29i . the integer-relaxed approximated op- 
timal coding duration can be expressed as 



T* 



1 



D + l 



1 



1 



(32) 



Inserting T*^ into ( [3T1 ). we get the approximated optimal 
channel multiplexing gain as 

.(LfJ) 



(33) 



.(1) 



Also, from ( |27] i. the approximated optimal negative SNR 
exponent is given as: 



,* _ rp* (T _ J,* ] 

(1 + ^)^ 2 



im(w.(L*j)) 



(l-A) 



(34) 

Below, we provide some observations of the above results: 
• The above result on d* can also be interpreted as a tradeoff 

which describes the relation between the normalized average 

arrival rate. 



A := lim (average bit-arrival rate) /N 



lim 

N^oo 



and the corresponding optimal negative SNR exponent d*j.{X) 
as a function of the delay bound D, and the average packet 
size 1/fi. For constant bit arrivals (CBR) at rate Alogp, i.e., 
mathematically when 0, any coding durations less 

than hallB of D (or more precisely [-jj) ^nd any channel 
multiplexing rates greater than A result in zero probability of 
delay violation. Hence, the optimal negative SNR exponent 
of the total error probability, denoted by d^^j^, is equal to 
the corresponding channel diversity when the optimal coding 
duration is at its maximum value, [-jj , and the channel 
multiplexing gain is at its minimum, A. That is, 

D 



(1-A). 



It is not surprising that this coincides with the classical DMT. 
With traffic burstiness, however, the optimal negative SNR 
exponent d*^{X) given in (l34l i is smaller than dQgj^{X). The 
ratio , ^ 

d*rW 1 



< 1 



can be interpreted as the reduction factor on the SNR exponent 
in the presence of burstiness. Figure |2] shows the impact of 
traffic burstiness (which is parameterized by yu) on d*^{X). 

"The first half of D is spent waiting for the next coding blocic and the 
other half waiting to be decoded at the end of the block. 



3 
2 



- M 

,,,+ 



- B 



■deterministic, D=1 1 
■H=.1, D=11 

H=.01, D=11 

H=.01 , D=300 
■H=.001, D=3000 



J = 64 



T = 2 
T*="i " 



-X- - . 

+ 



0.2 



0.4 



* : 



' - »« — . 



0.6 



0.8 



Fig. 2. SISO, Rayleigh fast-fading, coherent channel. The solid line describes 
the DMT (r = A). The dashed and dotted lines describe d*^(X) for various 
fi and D. 



• From a coding point of view, T*, is independent of the 
average bit-arrival rate A. This implies that for a fixed value 
of the average packet size l/fi, the optimal negative SNR 
exponent is achieved by a fixed-duration 1 x T*, code. Optimal 
codes for this setting exist for all values of r and T ([20], [21], 
[28]). On the other hand, if T is already given, the performance 
is optimized when the coding multiplexing gain is chosen as 
in (EB, i e.. 



.(T) = A 



1-A 



1 



fi{D + l-2T) 



• Since r^^^ = 1 for this SISO channel, we can verify that 
r*^ r^ax for very bursty traffic (i.e., ^ oo). That is 
for very bursty traffic the channel should operate close to its 
highest possible rate, which is the channel ergodic capacity. 

1) Numerical Comparison of the Approximation: Before 
we move to the next example, we illustrate numerically that 
the approximations in (|32]|-(|34]| well approximate their actual 
values in Theorem [T] In Figure |3] we show an example of 
a comparison at = 100 and various values of D and 
A. We observe that the approximated values match well with 
the exact values if D is sufficiently large. The matching is 
remarkably good for d* and d*^. Note that r*^ is independent 
of D, except when D is so small that T*^ = 1. 



B. Cooperative Wireless Networking with Optimal Clustering 

As studied in [30], [31], we consider communicating bursty 
and delay-limited information from an information source in 
a cooperative wireless relay network, shown in Figure IH 
where the diversity benefit of user cooperation is due to 
encoding across space and time [26], [32]. In the absence of 
delay limitation, having more cooperative users almost always 
improves performance. This is not the case, though, when 
one considers burstiness and delay QoS requirement. Take 
for example a network where the information-source node 
cooperates with v relays, under an orthogonal amplify-and- 
forward (OAF) cooperative diversity scheme and half-duplex 
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(a) d* and d* vs D and A 




?. = 0.1,0.4, and 0.7 



5 25 45 65 85 105 125 145 165 185 
D 

(b) T* and T* vs D and A 




+ r 



X = 0.7 



X = OA 



+ , X = 0.1 ,+ . 



5 25 45 65 85 105 125 145 165 185 
D 

(c) r* and r* vs D and A 

Fig. 3. Comparisons of the exact values d*, T* , and r* (3^, [3}j, [H:, 
respectively) and the integer-relaxed approximations d*^, T*^, and r*^., at 
various D and A. The dotted lines with markers correspond to the exact 
solutions while the solid lines represent the approximated solutions. 

constraint. This cooperation scheme gives the DMT: 

CW = (« + l)(l-2r). 

Note that rmax = 1/2 under this protocol. To realize this 
amount of diversity, the coding duration T is required to be 
at least 2{v + 1) channel uses or timeslots. This means that, 
in spite of the increase in the negative SNR exponent of the 
probability of decoding error with the number of cooperative 
relays, relaying over all nodes in the network might not be 
desirable as it increases the delay violations. Applying the 



result of Approximation |2] to CPE source and the above 
(i^[J°''(r) with T = 2(v + 1), the optimal performance is 
achieved when the nodes cooperate in clusters with 



D + l 



4(1 



1 



relays and transmit at multiplexing rate, 

1 h ^ ^ 
2 



1 + ^ 

Note that v*^ is independent of the traffic average arrival rate 
A. This means that meeting the optimal tradeoff for various 
values of A does not require modifying the cluster sizes, unless 
the traffic burstiness (parameterized by the average packet size 
changes. 



Cooperative 
Cluster with 
V* nodes 




Destination 
Node 



Node V 



Fig. 4. Snapshot of a wireless network, where the source node utilizes a 
subset of its peers (nodes 1, 2, . . . , d*) as relays for communicating with the 
destination. 

C. MIMO Quasi-Static Communications 

In the case of the MIMO Rayleigh fading channel with rit 
transmit and rir receive antennas, and with complete channel 
state information at the receiver (CSIR) and no CSI at the 
transmitter, the channel diversity gain (ich(j') is shown (see 
[1]) to be a piecewise linear function that connects points 

(fc, {uf — k){nj. — k)), = 0, 1, . . . , min(nj, n^). (35) 

The entire tradeoff is met \f T > rit [19]. An example of the 
effect of burstiness is shown in Figure |5] for the case of the 
2x2 Rayleigh fading channel {nt = Ur = 2). By assuming 
that T is given (not an optimizing parameter) and equal to 
2, the optimal multiplexing gain r* , which balances the SNR 
exponents of the probabilities of delay violation and decoding 
error, is the solution to dch{r*) = I{r* ,T = 2). Using the 
approximation ( fT9] l of / for CPE source, the approximation 
is the solution to 

dch(O-Mrr,-A)(i?-3) = 0, 

where dch is the piecewise linear function connecting points 
in ( |35] ). In other words, r*^ is given as 



' \ -X- 2-A 

+ 1+m(D-3)' 

A + 



3+A<(£i-3) ' 
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Figure |5] shows the resulting d*{X) 
values of burstiness fi and D. 



dch{r*r) for various 



D=10, no burstiness 
D=10, n=1 
D=10, n=1/10 




Fig. 5. MIMO, quasi-static, coherent 2x2 channel, d* v.s. A for different 
values of D and fi. 



VI. Summary and Future Work 

This work offers a high-SNR asymptotic error performance 
analysis for communications of delay-limited and bursty in- 
formation over an outage-limited channel, where errors oc- 
cur either due to delay or due to erroneous decoding. The 
analysis focuses on the case where there is no CSIT and 
no feedback, and on the static case of fixed rate and fixed 
length of coding blocks. This joint queue-channel analysis 
is performed in the asymptotic regime of high-SNR and in 
the assumption of smoothly scaling (with SNR) bit-arrival 
processes. The analysis provides closed-form expressions for 
the error performance, as a function of the channel and source 
statistics. These expressions identify the scaling regime of the 
source and channel statistics in which either delay or decoding 
errors are the dominant cause of errors, and the scaling regime 
in which a prudent choice of the coding duration and rate 
manages to balance and minimize these errors. That is, in 
this latter regime, such optimal choice manages to balance 
the effect of channel atypicality and burstiness atypicality. To 
illustrate the results, we provide different examples that apply 
the results in different communication settings. We emphasize 
that the results hold for any coding duration and delay bound. 

Many interesting extensions of the current work remain. 
One example is the high-SNR analysis of systems with re- 
transmission mechanism and/or adaptive adjustment of the 
transmission rate and coding duration as a function of the 
current queue length at the transmitter With retransmission, 
the diversity of the channel can be improved considerably [33] 
but at the cost of longer and random transmission delays. 
On the other hand, we may be able to improve the system 
performance by adjusting the transmission rate according to 
the need of the bits in the queue. For example, when the 
queue length is short, we may reduce the transmission rate, 
which improves the probability of channel decoding error but 
possibly at the cost of longer delays of the bits that arrive later. 
However, since in high-SNR analysis the probability of error 



is asymptotically dominated by the worst case probability, it is 
not clear whether such adaptive transmission rate mechanism 
will improve the asymptotic decay rate of the probability of 
bit error. 

In addition, this work focuses on the notion of SNR 
error exponent as a measure of performance. This view of 
communication systems provides a tractable and intuitive 
characterization of various suggested schemes in the high-SNR 
regime. It would also be interesting to fine-tune the high-SNR 
asymptotic analysis presented here, for the regime of finite 
SNR, as well as extend it to different families of bit-arrival 
processes. 

Appendix I 
Proof of Proposition[T] 

Proposition |7} Consider a g-smoothly-scaling process 
with the limiting ^-scaled log moment generation function A. 



Let = for t € N. Then, for a > Xt, we have 



.{N 



lim 



N- 



9iN) 



\ogP 



5, 



{N) 



N 



> a 



-tA*{a/t), (36) 



where A* is the convex conjugate of A. 

Proof: Let n = g{N) and f/"^ = ^^i^'- From Q 
and the property of A for the g-smoothly-scaling process, we 
have 



Ay, (6*) lim -log£;[e' 

n— i-cx3 Ji 



ev} 



which exists for each G M as an extended real number and 
is finite in a neighborhood of 6' = 0, essentially smooth, 
and lower-semicontinuous. Then, the Gartner-Ellis theorem 
(Theorem 2.11 in [14]) shows that y/"^/n (which, in this 
case, is equivalent to S^^^ /N) satisfies the large deviations 
principle (LDP) in R with good convex rate function 



Ay (x) := sup 9x 



Ay (9) = sup 9x 



tA{9) = tA*{x/t). 



For a > E[-j^] = Xt, the LDP result gives the assertion of 
the proposition (see Lemma 2.6 and Theorem 2.8 in [14]). ■ 

Appendix II 

Proof of Results on the Asymptotic Probability 
OF Delay Violation 

Lemma |2} Given gEQ,TET, r>X, a batch service of 
rNT every T timeslots, and a g-smoothly-scaling bit-arrival 
process characterized by the limiting g-scaled log moment 
generation function A, the decay rate of Pdeiay('', 2^) is given 
by the function /, i.e., 

lim ^logPdeiay(r-,T) = -/(r,r) (37) 



where 

I{r,T) 



tT+T-l-k>0 



{tT + T-l- k)A* 



{D + l- 2T)r 



tT+T-1- 



k 
(38) 
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and k = D{modT). In addition, I{r,T) is lower- 
semicontinuous and increasing on r. 

Proof: Let 5 e T e T = {1, 2, . . . , [-^J }, r > A, and 
k = _D(mod T). Without loss of generality, we assume that 
/(r,r) < oo. 

For any given SNR p and N = logp, there are ^j^' bits 
arriving at time t. The queue is being served exactly at times 
mT, for m £ Z, with an instantaneous removal of the oldest 
RT = rNT bits. The corresponding queue dynamics for the 
queue size Qi ' , 



at time t, are as follows. 

+ 



TR , if t = raT, m G Z, 
otherwise, 

(39) 

where = 0. Since the arrival process is stationary and 

the system started empty at time — oo, then Q,-^'' has the same 
steady-state distribution as that of m G Z, for each 

i = 0, ...,T — 1. The delay at time i also has the same 
steady-state distribution as the delay at time mT + i. Since 
P^eUyir, T), as a function of r, T, is defined as the probability 
of the steady-state delay being greater than D, we have 

-Pdelay(?', T) 

P(steady-state delay of a bit > D) 



T-l 



s-s delay of a bit arriving at time i > D) 



(40) 

where the equality holds since the arrivals are independent 
across time. From Lemma |4] in Appendix |V] we have that 
the delay violation probability of any bit arriving at time i is 
asymptotically equal to the delay violation probability of the 
last bit arriving at time i, ( l40b becomes 



f delay T) 



where qI^'' denotes the event that the last bit arriving at 
timeslot i violates the delay bound D. This holds because 
T is a constant independent of p. Hence, dTTT i says that Pdeiay 
is asymptotically equal to the sum of P(Q^^''). 

Next, we relate the event Q^^' to a condition on the queue 
length Q\^\ for i = 0, . . . , T — 1. To do this, we need to 
describe the condition that the delay of the last bit arriving 
at timeslot i violates the delay bound D. Upon arrival, the 
last bit sees qI^' bits (including itself) waiting in the queue. 
Since the batch service happens exactly in multiples of T, 
the bit must wait T — i timeslots for the next service to start 



i=0 



T-1 



E^(2ra (41) 



1=0 



and another 



BT 



T timeslots for all Q,-^'' bits (including 

the last bit) to get served and be decoded. Hence, the last bit 
arriving at time i violates the delay bound D if, and only if. 



T-i 



or 

RT 



T > D. 



Let fi'^' contains all measurable random events. The condi- 
tion above implies that the delay violation event for the last 



bit is given as 



RT 



T > D}. (42) 



Using ( [39] l and (|42] |. we show in Lemma [3] of Appendix IIVI 
that 



Pdelay(r,r) ^ P(Q^^\_,.) ^ P{Q't^,^, > {D-T-k)R). 

(43) 

Intuitively, this means that Pdeiay {'r^ T) is asymptotically equal 
to P{Q!'jl}i_f.), equivalently Pie\-iy{r,T) is asymptotically 
equal to the probability that the last bit arriving at time T~ 1— k 
sees a queue length greater than {D— T— k)R bits. 
Finally, using ( |43] ), what remains is to establish that 



(TV) 



lim 



\ogP{Q^^\_, > {D-T-k)rN) 



9{N) 



-I{r,T) 



min {tT+T-k-l)A* { r 

tT+T-l-k>0 



(D + l- 2T)r 



tT+T-k-1 



(44) 



For notational simplicity, let i := T — 1 — fc and q := (D — 
T-k)r. Note that q> ri>{) since T G {1, 2, . . . , [-|J } and 
k — D{ mod T). Now, since 

q + rTt _ {D + l- 2T)r 
Tt + i ^ ^ tT + T-k-1' 

it is sufficient to show that 



lim 



logF(g.|^^ >Nq) 
9{N) 



mm {Tt + i)K ( . ). 

tez+: 11 + I 

tT+i>Q 

(45) 

We separately show (matching) upper and lower bounds. 

First, we show the lower bound. By using the queue 
dynamics in ( [39] l recursively and the assumption of Q^f^ — 0, 
the queue length Q^^' is related to the arrivals j < i, 

in the following manner: 



sup 



J2 ~ ^^^^ ' (46) 



VJ = -tT+l 



where we use the convention that X^j^i ^j^'* = 0- Using this 
relation and the fact that g > 0, we have 



P{Q\'^>>Nq) = P sup J2 Af^-rtTN > Nq 



P 



sup ^ Af^-rtTN>Nq 

\tT+i>0-' 
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Now, for any fixed t e Z+ so that tT + i > Q, we have 
P(q|^^ > TVg) > P I ^ aJ.^^ - riTiV > Nq 

\j = -tT+l 

(tT+i 
^ Af^ > N{q + rTt) 

( c(^) 

Orj 



= P 



-'Tt-\ 



N 



> q + rTt 



Taking the limit of both sides and using Proposition [T] we 
have 

„, logP(Q|"; >«,) ^ „ ^ ,-,A.<2_tlI!,^ ,47, 

Since t is arbitrary, maximizing the RHS over t gives the 
appropriate lower bound: 

tT+i>0 

(48) 

For the upper bound, we use the following result from 
Lemma |5] in Appendix I VII 

log P (Q\^^ > Nq) , , q + rTt 

limsup ^ — < - inf {Tt+i)A* ' ^ 



tG 



7+. 



tT+i>0 



noting that the RHS is strictly greater than — oo, by assump- 
tion. Hence, the lower and upper bounds coincide and (l45T l 
holds. 

To complete the proof, we show the properties of I{r, T) 
for T G T. First, / is increasing on r > A because A*(a;) is 
increasing on x > A (Lemma 2.7 in [14]). Second, I{r,T) 
is lower-semicontinuous on r because / is the minimum of 
a number of function A* which are lower-semicontinuous 
(Lemma 2.7 in [14]). ■ 

Approximation Q} Relaxing the integer constraint in ( fT4l i 
gives the lower bound of / as 



I{r,T) > S,.r{D + 1 - 2T) =: /,^(r,T), (49) 



where 



(50) 



5r = snp{e > : A{e) < Or}. 
Proof: By the definition of /, we have 

I{r,T) 

D — 2T + 1 

= min (tT+r-l-fc)A*(r + r ) 
tez+: tT+T-l-k' 

tT+T-l-fe>0 

^ ■ AW i:'-2r + i- 

> mm tA (r + r I 

rGR+ T ' 

= ,5^r(i:'-2T+l), 

where the last equality is a result of Lemma 3.4 of [14] with 
Sr defined as in ( fSOl l. ■ 



Appendix III 
Proof of the Main Result 

Proof: [Proof of Theorem [T] Recall that: 

F,o,(r,r) Pch(r,T) + (1 - Pch(r-, r))Fdeiay(r, T), (51) 
where, from (|2]l, 



Pch (r,T) 



-dch(r,T) 



and, from Lemma 121 



Pdelay(r,T)^e-^('--^)f('°*5''). 

Case 1 : when lim = 7 e (0, 00). We have 



delay 



and 



The optimal negative SNR exponent of Ptot is 

log PUr, T Y 
logp 



(52) 
(53) 

(54) 
(55) 



d* := sup < lim - 

tgt 



sup {miii{7/(r,T), dai{r,T))) 



= max <; sup {mill {7/(7", r), dch(f, P)}} > • (56) 



re(A,r„,„) 



We first solve the optimization sub-problem within the bracket 
for any given integer TeT. Because /(r, T) is increasing on 
r > A while dchl'^i is strictly decreasing on r G [0, r^^y\, the 
sub-problem is solved by the optimal choice of multiplexing 
gain when the coding duration is fixed at T as 

r*{T) inf{r G (A, w) : 7/(r,r) = d,,,[T,T)}. (57) 

Hence, ( |56] l is solved with the optimal coding duration T*, 
given as 

T* = argmax -fl{r* {T),T), 

and the optimal multiplexing gain r* , given as 

r* = r* (T*). 

Note that, since I{r,T) > when r > A and dch{r,T) > 
when r < r^ax, it is guaranteed that r*{T) G (A,rn,ax)- 



Case 2: when lim 



9{N) _ 



= and lim 



9(N) 



00. In this 



case, for all r G (A, rmax) and all T G T, we have Pdeiay('', T) 
asymptotically dominates Pch(r, T) and hence Ptot(?', T) is 
asymptotically equal to PdeiayC?", P)- Since, for any P G T, 
J(r, P) is increasing on r > A, we have 

r -log -Ptot (r,P) ^ / ^, ^,1 

sup iim Y^ ^ S max < sup i[r, 1 ) > 



re(A,r„,„), P-'"" 5(l0gp) 

TeT 



TeT 



i-e(A,r„,„) 



Case 3 : when lim 

AT— >oo 



N 



= max I{r^^x,T). 
TeT 

00. This case is an opposite of 



Case 2. Here, Ptot(?', P) is asymptotically equal to Pch{r, P) for 
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all r e (A, j'max) and all T G T. Since dch{r,T) is decreasing 
on r and increasing on T, we have 

r -log-Ptot(r,T) ^ / . / 

sup lim < max < sup dchir, 1 ) > 

re{A,r„,„), ^°SP |re(A,r„,„) J 

= max d^h(X, T) 



dch(A, 



Appendix IV 
Proof of Lemma [3] 

In this appendix, we prove the following lemma which is 
used in Appendix HH 

Lemma 3: Consider g e G, T G T = {!,..., [f J}, 
r > A, a family of g-smoothly-scaling bit-arrival processes 
characterized by the limiting g-scaled log moment generation 
function A, and a periodic batch service of rNT bits at 
timeslots mT, m e Z. Let Q\^^ be the queue length at time 
i e {0, . . . , T - 1}. Then, the event q!'t},,_^, defined as 



-T-fc- 



= i G r2^^' : fc + 1 + 



RT 



T > D 



with k ~ £'(mod T), asymptotically dominates Pdeiay(?', r). 
In other words, 

Pdeiay(r, T) ^ Vy{Q^^\_^ > {D - T - k)r\ogp). (58) 
Proof: Let k = D{ mod T) and i e {0, . . . , T - 1}. 
Recall from (HI) that 



RT 



T > D}. 



Now using the observation that, for any x, G K 
we have 



Q 



{N) 



U) . — — > 



RT 



D + i-T 
T 



{to) > {D-T-k)R}, ie[0,T-k-l] 



[{u;:Q\"\u;)>{D-k)R}, 
On the other hand, dTTT i implies that 



i e [T-k,T-l]. 

(59) 



T-l 



(JV)^ 



i=0 



T-l 



4=0 



=T-k 



(a) 



i^max{P(Q(.^_t,),P(Q^:\)} 



= max{P(Q^^\._i > {D-T-k)R),P{Q)j:\ > {D-k)R)} 



(N) 



P{Q'^rX-i>{D-T~k)R), 



(60) 



where the equality in (b) is from ( |59] l. Next, we establish the 
(asymptotic) equalities (a) and (c). For (a), we first need to 
show that 



"E'VcQf )^F(Qf.L,). 



(61) 



To establish this, we first observe that 

Qf\.) = Qf\u)+A^il\{u:) + ... + A<f\.) > Qf 



>o 



(62) 

for all uj e and < i < j < T - L Hence, from 

we have 



P{Q^T-k-i) > PiQ^"'): ^ e {0, . . . , T - fc - 1}, 

which implies 

T-fc-l 

E ^(Qf^)<(r-fc-)P(Q^^Li)- (63) 



On the other hand, from the non-negativity of probability, we 
have 

T-fc-l 



(64) 



i=0 



Combining ( l63T l and (|64] |. we have (1611 1. Similarly, we can 
show that 

T-l 



(65) 



Combining dMT l and ( |65] l, equality (a) in ( |60l l is established. 
To establish equality (c), it is sufficient to show that 

PiQ^^^>D'R) < P{Qf^>D'R) < PiQ^^'^ > {D'-T)R), 

(66) 

for any D' > T and j E {0, . . . , T — 1}. This is because for 

ji = T - 1 and D[ = D - k, we get 

P{Q^rJ, >{D- k)R) < P(q('^' >iD-T- k)R), 
while for = T- fc- 1 and D!^ = D- T- fc, we get 
P(q[,'^' > {D-T-k)R) <P{Q^T}k-i > {D-T-k)R), 
asserting (c). 

We prove (|66] | in two steps. The lower bound directly 
follows from (|62]|. i.e.. 



For the upper bound, we notice that, for D' > T and 



c 



q'j^\uj) is related to Qlf '(uj) as 



= gf ) {cj) + A)l[ {cj) + ■ ■ ■ + A'^" {cj) - TR, 
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where [•]+ is removed. As a result, we have 

P{Qf^ > D'R) 



[A%\+... + A':r''^}+TR>D'R) 



< P{Q^^^ > [D' - T)R) 
= P(Qr^ > {D' - T)R), 



where the last equality holds since and Qo^'' have the 

same stationary distribution. ■ 



Appendix V 
Proof of Lemma|4] 

This appendix shows that the average probability of delay 
violation for bits that arrive at time i is asymptotically equal 
to the corresponding probability for the last bit arriving at 
that time. The proof is mainly based on the definition of the 
5-smoothly-scaling process. 

Lemma 4: Consider g G G and a family of 9-smoothly- 
scaling bit-arrival processes ((4^\t G Z),N G N), char- 
acterized by the limiting g-scaled log moment generation 
function A. For any given N, let be a random variable 

having the same distribution as the steady-state distribution 
of the delay of a randomly chosen bit that arrives at time 
i G {0, . . . , T — 1} while Z^^^^ is a random variable having a 
distribution that is identical to the steady-state distribution of 
the delay for the last bit that arrives during time i. Then, for 
any D > 0, 

p(^(^) > D)^ PiZ^'^^ > D). (67) 
Proof: We show ( |67] | by showing the upper bound: 



P(H/(W) > D)< P{Z^^^ > D) 
and the lower bound: 

> D)> PiZ^'^^ > D). 



(68) 



(69) 



The upper bound is an immediate consequence of M^(^) (w) < 
forcj G Below we prove the lower bound. We 

have 



(70) 

Now, given that A^^'' — a bits arrive at time i, we index the a 
bits as bit 1 to a, where bit 1 arrives first and bit a arrives last. 
Given Af^'^ = a, we let W,-^'' to be the steady-state delay of 
the j-th bit, j G {1, . . . , a}. Since the bit can have any index, 
from 1 to a, with equal probability of 1 /a, we have 

P(W^W > ) = ^ E >D\4''^ = a). 

Ignoring all but the last term in the sum, we have 

= ip(zW >i?|4W=a), 
a 



where the equality is a result of how Z^^^ is defined. This 
means that 

P{W^^^>D) > Y.^P{Z^'''>>D\4''^=a)P{4''^=a) 
= ^ip(zW>i5and Af'=a). 

a(EN 

Now, for a given /3 > 0, define 

PW :={feGN:6<e''sW}. 
We can further lower bound P(VF'^^ > D) as follows: 

> D) 

> ^ ip(zW > and ^ = a) 
>e-^5W P{Z(^^ > D and 4''^ ^ a) 

= e-^«(^)p(Z(^) > D and A^^ G P*^'), (71) 

where the second inequality holds because 1/a > e~l^9(N) for 
any a G P^^'. 

Next, we show that P(A^^' G P^^^) ^ 1 as iV ^ 00. 
We do this by using the definition of the g-smoothly-scaling 
process: there exists 9 > Q such that 

lim ^ i = Ai6) < 00. 

Af-^oo g{N) ^ ' 

Hence, for any e > 0, there exists = -/Vo(e) such that for 
all > A^O; we have 

g{N){k{e) + e) > logP[e^^'"'^?W/^]. (72) 

The RHS can be lower-bounded, for any ai G N: 

logP[e«^-"'«(~)/^] = log [Y^PiA'-P ^ a)e''"s(^)/^') 

VaGN / 

> log I E Pi^^P = a)e«'^»(^)/^ 

\ a>ai 



>log (P(^f ) > a^e^^^^W/^ 
= 0«,^+logP(AW>ai). 



A^ 



This together with (|72] | gives 



logP(Af ^ > < g(Ar)[A(0) + ^ - 
for all fli G N. Now, we select a\ = e^^'^^'' to get 
log (1 - P(Af ^ G P(^))) = logP(Af ) > e/^st^)) 



6*01- 



<5W[A(0) + e 



Since lim = 00, we, then, have 



(^) c r(^) 



p Ur' gp 



(73) 
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Finally, combining ( TTSl l and (fTTl ) implies that, for any /3 > 0, 

logP(W^W>D) logP(zW>D) ^ 

iim > 1™ TTT^ P- 

Since /3 can be chosen arbitrarily small, we have the lower 
bound in f69[ . hence the assertion of the lemma. ■ 



Appendix VI 
Proof of Lemma[5] 

In this appendix, we prove the following lemma which is 
used in Appendix HH 

Lemma 5: Consider g e G, T e T = {1,...,[-|J}, 
r > A, a family of g-smoothly-scaling bit-arrival processes 
characterized by the limiting g-scaled log moment generation 
function A, and a periodic batch service of rNT bits at 
timeslots mT, to G Z. Let be the queue length at time 



i G {0, . . . , T — 1}. Then, for q > ir, we have 
hmsup f— <- mf {Tt+i)A 

N^oo g{N) tGZ+: \ Tt + l 

tT+i>0 



q + rTt\ 



(74) 

assuming that the RHS is strictly greater than — oo. 

Proof: The proof uses the same technique as in [14, 
Lemma LIO and Lll]. Using ( l46b . we have the following 
bound: 



P{Q[^^>Nq)^P 



sup Af^~rtTN>Nq 

\tT+i>0->- 



= P I sup S[!^l^ - rtTN > Nq 



te: 



7+. 



ytT+i>0 

< E P{sl^l>N{q + rTt)] 

Now, for any fixed to E N, we have 

P(gW>7Vg)< J2 PiSlTl>Niq + rTt)) 

-^<t<to 

+ J2Pi^tTl>N{q + rTt)). (75) 

t>to 

Employing the principle of the largest terrr0 gives 



lim sup 



log P(gf) > Nq) 



9{N) 



log P{si^l>Niq + rTt)) 
9iN) 



< max ( max lim sup 

i<t<to AT— oo 



lim sup log Pisl^l > N{q + rTt))] ■ (76) 



t>to 



'^The principle of the largest term [14, Lemma 2.1]: Let an and bn 
be sequences in IR+. If n~^loga,i ~* a and n~^logfe„ b, then 
n^^ log(a„ + b„) —* max(a, b). This extends easily to finite sums. 



For the first term (the t < Iq term) in the maximum, we 
use Proposition [T] to get 



max lim sup ■ 



1 



■logP 



(N) 
tT+i 



±,<t<tQ N->oc g{N) ' ° \ ^ 

< max -{Tt + i)A* ( + ''^^ 
-j,<t<to \ Tt + i 

<- inf iTt + ^)A*^^-±^ 
tez+: \ Tt + i 

tT+i>0 



> q + rTt 



(77) 



which is the RHS of ( l74l i and finite by assumption. 

Now, we show that we can select to appropriately such that 
the second term (the t > to term) in the RHS of (|76] i is also 
no greater than the RHS of (|74] i. In other words, we show that 
there exists to such that 

lim sup log ^ P [Sl^l > Niq + rTt) 



N 



< 



t>to 



inf (Tt + i)A* 



tT+i>0 



q + rTt 
Tt + i 



(78) 



This is shown by proving that there exist some 6 > and 
e > such that 

lim sup -L^ log ^ P (^Sl^l > N{q + rTt)) 



' t>to 

<-e9{{to + l)T + i), 
for all to G I^- Now, selecting 

to 



(79) 



eOT tgz+: 

tT+i>0 



Tt + i 



provides dTSb . 

To prove ( |79l ), we first use Chernoff bound as follows: 

J2PiS^tTl>N (q + rTt)) 



t>ta 



t>to 



t>to 



t>to 

5^cxp(-g(iV)(tT + z) 



t>to 



q + rtT\ logE[e 



tT + i 



9iN) 



), (80) 



where 6 is an arbitrary positive scalar and the second equality 
is a consequence of i.i.d. assumption on A^^K 

Next, we use the convexity of A and the fact that A'(0) = 
\ < r (Remark |2]i to establish that there exist some > and 
e > for which 

A{9) <e{r -2e). (81) 
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On the other hand, from ([6]l, we know that 
'S.^ — 1 — 1 A (6*). This means that there exists 



a Nq = No{0, e) such that, for all N > Nq, 
\ogE[e- 



9iN) 

Combining this with dSTT i. we have 

8a(iV) 



< A{0} + ee. 



log£;[e- 



(82) 



9iN) 

for all iV > iVo- 

Hence, using (ISZt . the term inside the square bracket in (ISOl l 
can be bounded, uniformly over all t > to, as 

q + rtT\ \ogE[e 
tT + i 



9{N) 
log£'[e 



9iN) 



> Or - 

> Or - 
= 0e, 



\ogE[e- 



9{N) 
(r-e) 



(83) 



where the first equality holds because q > ir, by assumption. 
Inserting into we have ( |79] i: 



< lim sup ^777 log 



t>to 



lim sup 



cxp(-5(A^)(iT + i)6le) 



log 



t>to 

g-3(Ar)e£((io + l)T+j) 



1 _ g-g(W)eeT 

= -ee{{to + l)T + i), 
and, hence, the assertion of the lemma. ■ 
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